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UNIFORM  DISTRIBUTIONS  OVER  A  SIMPLEX 


1.  Intrcxiuctlon 


We  are  interested  in  a  random  assignment  of  n  +  1  probabilities 

vrith  p^  +  •••  +  =  1.  By  that  we  mean  that  the  random 

vector  TT  =  tiniformly  distributed  over  the  simplex 

f  ntl  * 

S^^^  =  -{(x^, . . .  ,x^_l_^) ;  >  0,  Z  x^  =  Ij-,  and  this,  in  turn,  has  a 

meaning  we  will  make  precise  in  section  2.  We  will  find  the  joint  distri¬ 
bution  of  the  p's  and  consider  various  ways  of  representing  the  p's 
in  terms  of  familiar  random  variables.  It  turns  out  that  the  joint 
distribution  of  the  p's  is  very  simple  when  given  in  terms  of 
P[p^>  a^,...,p^^^>  rather  than  the  conventional  P[p^<  a^,...,p^^^< 

in  fact,  P[pj^>  a3^»-**»Pn+i>  ^n+l^  ^  "  ^1 . Vl^^* 


The  principal  purpose  of  the  development  is  to  get  Theorem  4>  which 
provides  a  method  for  generating  exponential  (and  hence  normal,  using 
polar  coordinates)  random  variables  in  a  computer  —  tt  is  chosen  from 
then  a  random  variable  z  is  chosen  so  that  zp^,zp2,  •  •  • 

are  independent  and  exponentially  distributed.  We  will  also  point  out 
how  TT  may  be  used  to  produce  points  uniformly  over  the  simplex 
supported  by  an  arbitrary  set  of  n  +  1  linearly  independent  points  in 
space.  Thus  the  main  purpose  of  the  development  is  to  provide  some  theory 
on  which  certain  "Monte  Carlo"  techniques  may  be  based,  but  the  development 
will  also  provide  an  approach  to  problems  involving  xiniform  order 
statistics  which  is  different  from  that  usually  taken. 


2.  Uniform  Pis trlbutl ons 

Let  be  Euclidean  n-space,  and  let  \  be  Lebesgue  measure.  We 

say  that  are  .jointly  vuilformly  distributed,  or  that  the 

vector  C  =  (  •  •  •  >2^^)  is  ttnlformly  distributed ,  over  a  subset  S  C 

vfith  positive  Lebesgue  measure .  if  the  probability  measure  for  C  is 
proportional  to  Lebesgue  measure  —  for  \-measurable  B,  P[( z^, . . . ,z^)eB] 
X(B)/\(S).  Then  z^,...,z^  have  a  joint  density  function;  it  is  constant, 
l/X(S) ,  over  S  and  zero  elsewhere.  If  A  is  the  n  x  n  non-singular 
matrix  of  a  linear  transformation  T,  then  T(C) ,  (=CA.) ,  is  uniformly 
distributed  over  the  image  set  T(S). 


Let  G  be  the  simplex  supported  by  k  linearly  independent  points 

ic  k 

a^,...,aj^  in  R^^:  G  =  {p:  P  =  Sc^^a^,  >  0,  2c^  ”  i]’  We  say  that 

ri  =  (y^, . . .  ,yj^)  is  uniformly  distributed  over  the  simplex  G  if  for 
some  1  the  vector  (y^jy^, . . . ,y^  I’^i+l’ * ' * ’^k^  uniformly  distributed 
in  the  above  sense,  over  the  projection  of  G  onto  ^  formed  by 
suppressing  the  ith  coordinate,  i.e.,  T^(G) ,  where  is  the  trans¬ 
formation  (y^,...,yj^)  - >  +!>•••  >7)^)  taking  Rj^ 

onto  Rj^  This  is  equivalent  to  saying  that  the  probability  that  q 
fall  in  a  subset  H  of  G  is  proportional  to  the  surface  area  of  H. 

For  each  i,  T^(q)  will  be  \iniformly  distributed  in  the  first  sense 
over  Tj^(G)  ,  except  in  the  nuisance  case  when  one  of  the  coordinates, 

say  X.,  of  each  point  ^  =  (x.,,...,x,)  of  G  is  constant;  x.  =  c. 

J  -L  K  J 

Then  only  T.(ri)  will  be  uniformly  distributed. 
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We  note  in  particular  that  if  it  =  is  uniformly- 

distributed  over 


^n+l  =  ^i  ^  ^  ^i  = 

then  each  set  of  n  of  the  p's  is  iiniformly  distributed  over  the 
regular  poly tope 

S^=  X.  >0,  2  x^  <!]. 

If  A  is  the  k  5<  k  non-singular  matrix  of  a  linear  transformation  T, 
then  T(r)),  (=fiA) ,  is  uniformly  distributed  over  the  simplex  T(G). 

This  follows  from  noting  that  T  is  the  product  of  elementary  linear 
transformations,  and  if  yi>‘**>y}c  tiniformly  distributed  over  a 

simplex,  then  ay^,y2, . . .  ,yj^  (a  ^  O) ,  or  y2>y3^  jy^*  •  •  •  >yk 
^1  ^  uniformly  distributed  over  the  appropriately 

transformed  simplex.  We  have  in  particular; 

Theorem  1.  ^  rr  =  (p  ,  ...,p  )  is  uniformly  distributed  over  the 

^  n+1 


simplex  S 


;^^1  =  [(x^,...,x^^^):  Xi  >  0,  2  x^  =  l}  an^  “r*--°n+l 


are  n  +  1  linearly  independent  points  in  then  the  random  vector 

p^a^  +  P2‘^2  Pn+l'^n+l  is  uniformly  distributed  over  G,  the 

n+1  n+1 

simplex  supported  by  the  a's:  G  =  [y*  X  ~  2  c.a.  ,c.  >0,  2  c.  =  l]. 

1  1  1  1 

This  theorem  pro-vldes  a  practical  method  for  producing  random  points 
uniformly  over  regions  with  linear  boundaries,  for  example,  in  certain 
Monte  Carlo  procedures,  and  it  also  shows  how  the  distributions  of  the 
coordinates  of  a  random  viniform  point  from  a  simplex  may  be  found  in 
terms  of  the  distribution  of  a  linear  combination  of  p^,...,p^^^. 


L 
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If  S'l’e  n  linearly  Independent  points  in  and 

(p^,...,p^)  is  imiformly  distributed  over  S*  =  [(x^, . . .  ,x^) :  >  0, 

2  x^  <  l],  then  P-j^Y^  +  **•  +  Pj^Yj^  is  uniformly  distributed  over  the 
set  H  =  [p:  p  =  2  c^Yj^j  >  0,  2  <  l].  Let  Yq  ^>6  the  origin. 

Then  H  may  be  viewed  as  the  convex  polytope  supported  by  Yq>Y-]^>  •  •  •  >Yj^> 


i.e. , 


H=|&;  &  =  CTYn  +'**+cY 
'■  I'l  n'n 


+  c 


n+l'^'o’  °i  ^  ?  °i  “ 


n+1 

2 

1 


Hence,  if  p^  -  Pq,...,P^  -  p^  are  linearly  Independent,  and  if 
TT  =  (P]^>  •  •  •  jPj^’Pn+i^  uniformly  distributed  over 


Vl  = 


n+1 

2x^  =  1}, 


then  p^p^  +  ...  +  =  p^(p^  _  p^)  +  p^(p^  .  p^)  +  ...  + 

^n^^n  ”  ^0^  ^  ^0  ■uniformly  distributed  over  the  convex  hull  of 
Pq,P^, . . . ,P^,  that  set  being  the  translate  of  the  convex  hull  of 

Pq  “  ^O’^l  ~  ^0’’*'’^n  ~  ^0* 

Theorem  2.  ^  n  =  (p^^j •  •  •  is  \miformly  distributed  over  the 

simplex  =  {(x^, . . .  ,x^_l^^) :  x^  >  0,  2  x^  =  l}  and  if  Pq,P^,...,P^ 

are  n  +  1  points  in  such  that  p^  -  p^,  •  •  •  jp^^  -  Pq  are  linearly 

Independent,  then  the  random  vector  P^^Pj  +  •  •  •  +  P^Pj^  +  ^n+l^O  — 
^miformly  distributed  over  H,  the  convex  poly  tope  supported  by  the  p's: 

n+1 

H  =  [«  6  =  djPj  t  ...  +  i  0,  Z  d^  =  1}. 


The  most  common  application  of  this  result  is  in  the  plane  —  to 
choose  a  point  p  from  the  triangle  formed  by  Pq,P^,P2  we  choose 
(Pi,P2>P3)  uniformly  from  and  put  p  =  p^p^  +  P2P2  +  P3Po* 

choose  a  point  uniformly  from  a  polygonal  region  in  the  plane,  we  divide 
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the  region  Into  triangles,  choose  a  triangle  vfith  probability  proportional 
to  its  area,  then  choose  a  point  uniformly  from  that  trieingle. 

If  (p^,...,p^)  is  uniformly  distributed  over  the  polytope 
=5=  {(=^- •••>='»>’  iO,  2,^  <  l] ,  then  the  vector 
(j  =  (p^,p^  +  P2,...,P2  P2  *“  uniformly  distributed  over 

^  =  {(y^j  •  •  •  >yj^)  5  0  <  y^^  <  y2  1  ***  <  y^  ^  hence  the  components 

of  u  are  distributed  as  the  order  statistics  of  independent  uniform 
[0,1]  random  variables  u^,...,u^.  Going  in  the  other  direction,  if 
^(1)  <u^2)  ^  ^ order  statistics  of  n  independent 

uniform  [0,l]  random  variables,  then  the  vector 

(u^^^,u^2)  "  ^(1)  ’  *  *  * ’'^(n)  ~ '^(n  1)^  uniformly  distributed  over  S* 
and  the  vector  (u^)  -  u^^  , . . .  ,U(^)  -%-l)'^  ’  ^(n)^ 

vinlformly  distributed  over 

n+1 

Vl  =  ^i  Sx^  =  l}. 

This  relationship  provides  a  practical  method  for  producing  n  uniformly 
on  ill  ^  computer  —  we  order  a  set  of  n  independent  uniform 

random  variables  and  take  successive  differences. 

We  will  need  the  following  fact:  if  u^,...,u^  are  independent 
random  variables,  each  uniform  on  [0,l],  and  0  <  c  <1,  then 

(1)  P[u^  +  •••  +  u^  <  c]  =  c^n!  . 


This  is  a  special  case  of  the  well-known  result  on  the  distribution  of 
the  sum  of  viniform  random  variables ,  but  it  can  be  proved  by  induction 
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quite  easily,  since 


P[ut+***+  u  <  c]  =  5  P[ut+***+  u  <  c  Iu  =  x]dx  =  i  P[u,+***+  u  ,<  1  -  x]dx. 
■•I  n  ■'  Q  1  n  '  n  ^  1  n-1 

We  note  that  (l)  gives  the  Lebesgue  measure  of  the  region 

(2)  A  =  {(x^,...,x^):  x^  >  0,...,x^  >  0,  x^  +  •••  +  x^  <  c}. 


3 .  The  Joint  Distribution  of  , . . . , 

Theorem  3«  If  tt  =  (p^,...,p^,p  ^  ^  is  uniformly  distributed  over 

n+i“ 

=  [(xj^, . . .  :  x^  >  0,  2  x^  =  l],  then  the  .joint  distribution 

of  the  p's  is ,  for  non -negative  a^, . . .  ,a^_^^: 

P[Pl  >  a^,P2  >  a2,...,p^  >  a^,p^^^  >  =  (l  -  a^  -  a^ . 

for  a^  +  •  •  •  +  a^^^  ^  1  and  zero  for  a^  +  •  •  •  +  a^_^^  >  1 . 

The  proof  follows  readily  from  the  fact  that  the  two  sets 
A=  {(x^,...,x^):x^  >0,...,x^>0,x^+---+x^<cj 

B  =  [(y^,...,y^);y^-  a^>  0,...,yj^-  a^>  0,(y^-  a^)+(y2-  '  *'^^yn“  ^n^ 

are  congruent,  and,  by  (2)  and  the  remark  preceding  (2),  their  common 
Lebesgue  measure  is  c^/nl,  for  0  <  c  <1.  Then 


P[Pl>  ^i>‘*-»Pn+i>  ^n+1^  P[Pl>  a^,...,Pj^>  a^,l  -  P^^  - 


P  >  a  ] 
*^n  n-* 


=  P[Pl-  ai>  0,...,p^-  a^>  0,(p^-  a^)  +  ...+(p^-  aj<  1  -  a^  -  •  •  •  -  a^^^] , 
The  latter  is  the  probability  that  (p^^,  •  •  •  ,Pjj)  will  fall  in  B,  with 


c  =  1  -  a. 


-  a^  -  and  hence  equals  X(B)/X(S*)  =  c*^. 
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Putting  the  appropriate  a's  equal  to  zero,  we  have 

Corollary.  If  n  =  (P]_>  •  •  •  jP^’^n+l^  Is  uniformly  distributed  over 
then  the  .joint  distribution  of  p^, . . .  ,p^,(k  <  n) ,  Is  given  by 

P[P2>  aj^]  =  (1  -  a^  -  a^  -  •••  -  a^>  0,a^  +  •••  +  a^^  <  1, 

and,  taking  partial  derivatives,  the  .joint  density  of  p^ , . . . ,  p^^  is 

n(n+l)  •  • -(n-k+l)  (l-x^-*  •  for  >  0,  x^+*  *  •+  Xj^  <  1 

and  zero  elsewhere. 

We  now  get  this  interesting  result: 

Theorem  4.  ^  tt  =  (p^, . . .  is  uniformly  distributed  over 

n+l^ 

=  {(x^, , . .  ,x^_l_^)  :  >  0,  2  x^  =  l],  and  if  z  has  density 

function  x^e~^/n!,  x  >  0,  and  is  independent  of  tt,  then  the  random 
variables 


j  ^P2  >  •  ♦  •  >  ^P^i+l 


are  independent  and  exponentially  distributed,  (density  e"^,  x  >  O) . 

The  proof  comes  from  integrating  the  conditional  probability  for 
given  z ; 


n  -X 
X  e 


P[zp^>  b^,...,zp^^^>  b^^^]  =  5  p[zp^>  b^,...,zp^^^>  b^^Jz  =  x]  ^ 


n  -X  00 


“  b, 

S 

b-,+***+b  ^ 


Vl."  x^e-^. 
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as  the  order  statistics  of  n  independent  \miform  [0,1]  random  variables. 
Now  let  k  be  given,  1  <  k  ^  n.  Then  v^,...,Vj^  ^  may  be  written 


yi+**-^yv  k’ 


yi+-  • 


€ind  •  •  •  »''^n  written 


V,  +  (1-v.  )  - r- - 

^  yk+i-^'-'-^n+i 


.  X  ^k+l‘^k+2 

>  V  +  (1-Vj^)  +...4V  ’ 

^k+l  ^n+1 

,  „  s  W'--% 
\  ^  y — +:v.w — ^ 

^k+1  ^n+1 


and  it  is  obvious  that,  for  given  Vj^,  the  random  variables  in  (5)  are 
distributed  as  the  order  statistics  of  k-1  uniform  variables  on  the 
interval  [0,Vj^],and  are  independent  of  the  variables  in  (6),  which  are 
distributed  as  the  order  statistics  of  (n-k)  uniform  variables  on  the 
interval  [Vj^,l]. 

One  may  prove  directly  that  each  of  >  •  •  •  > ^  beta  distri¬ 
bution,  but  an  interesting  alternative  method  is  to  take  advantage  of 
known  facts  concerning  the  ratios  of  quadratic  forms  in  normal  variables, 
as  each  of  the  v's  may  be  viewed  as  such.  The  relationship  between 
the  beta  and  the  F  distributions  may  be  brought  in  at  this  point,  too. 


i 


